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ABSTRACT 


Conditional  on  the  value  of  6,  6 > 0,  let  X(t),  t > 0.  be  a homogeneous 


Poisson  process.  Sequential  estimation  procedures  of  the  form  (o,9(a))  are 


considered.  To  measure  loss  due  to  estimation,  a family  of  functions,  indexed 


and  the  cost  of  sampling  involves  cost 


per  arrival  and  cost  per  unit  time.  The  notion  of  "monotone  case"  for  total 


cost  functions  of  a continuous  time  process  is  defined  in  terms  of  the  characteristic 
operator  of  the  process  at  the  total  cost  function.  The  Bayes*  sequential  pro- 
cedure is  then  derived  for  those  cost  functions  in  the  monotone  case  using 
extensions  of  Dynkin's  identity  for  the  characteristic  operator.  The  properties 
of  these  procedures  are  studied  as  sampling  costs  tend  to  0,  and  the  procedures 
are  then  modified  and  compared  with  procedures  which  are  optimal  among  all 
stopping  rules  terminating  at  arrivals. 
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SIGNIFICANCE  AND  EXPLANATION 


When  trying  to  explain  or  analyze  events  that  occur  randomly  in  time  or 
in  space,  one  tends  first  to  test  whether  the  events  are  governed  by  a Poisson 
distribution.  The  classic  example  concerns  the  number  of  soldiers  killed  per 
year  from  the  kick  of  a mule  in  the  Prussian  Army  in  the  early  1800's,  and 
applications  have  continued  to  this  day  in  very  many  different  contexts,  military 
and  otherwise. 

Usually,  the  mean  arrival  or  occurrence  rate,  of  a Poisson  process 

is  unknown.  This  paper  derives  optimal  procedures  for  estimating  the  value 
of  6,  The  class  of  procedures  considered  xs  "sequential"  in  the  sense  that  nei- 
ther the  length  of  time  the  Poisson  process  is  observed  nor  the  number  of 
arrivals  observed  are  fixed  in  advance.  Instead,  the  outcome  of  the  process 
prior  to  any  point  in  time  may  be  used  to  decide  whether  to  continue  observing 
the  process  beyond  that  time. 


< 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive  summary 
lies  with  MRC,  and  not  with  the  authors  of  this  report. 


Dynkin's  Identity  Applied  to 
Bayes'  Sequential  Estimation 
of  a Poisson  Process 
by 

C.P.  Shapiro  and  Robert  L.  Wardrop^ 

Michigan  State  University  and  University  of  Wisconsin-Madison 

1.  Introduction.  Conditional  on  the  value  of  6,  6 > 0,  suppose  that  X(t), 
t ^ 0,  is  a homogeneous  Poisson  process.  Sequential  estimation  procedures  of 

A 

the  form  (a,9(o))  will  be  considered  where  a is  a stopping  time  with  respect 
to  t ^ 0),  with  the  sigma  algebra  of  events  generated  by  {X(s),  0 £ s it), 
and  e(o)  is  an  ^(o)  measureable  function,  with  ^(o)  the  sigma  algebra  of 
events  prior  to  a. 

A family  of  functions,  indexed  byp,  0£p<3,  will  measure  loss  due  to 
estimation: 

(l-l)  Lp(0,0)  = e'P(0-0)^. 

The  effect  of  the  p is  to  measure  loss  in  units  of  0'^.  The  case  of  p » 0, 
squared  error  loss,  has  received  the  usual  attention.  Dvoretsky,  Kiefer,  and 
Wolfowitz  (1953),  and  Hodges  and  Lehman  (1951)  have  suggested  p = 1.  Using  p ~ 2 
causes  the  decision  problem  to  be  invariant  under  the  group  of  scale  trans- 
formations (Ferguson,  1967).  The  restriction  of  p to  [0,3]  is  for  mathematical 

^ This  research  was  sponsored,  in  part,  by  the  United  States  Army  under  Contract 
No.  DAAG29-75-C-0024. 
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tractibility,  and  the  inclusion  of  noninteger  p values  increases  the  com- 
putational difficulty  only  slightly. 

The  cost  of  sampling  will  involve  two  components:  c^,  the  cost  of  ob- 
serving one  arrival,  and  Cj,  the  cost  of  observing  the  process  for  one  unit 
of  time. 

In  section  2,  the  notion  of  "monotone  case"  for  continuous  time  problems 
is  defined  in  terms  of  the  characteristic  operator  of  X(t)  at  the  total  cost 
function.  Then  with  a gamma  prior  distribution  on  9,  the  Bayes'  sequential 
procedure  is  derived  using  extensions  of  Dynkin's  identity  when  the  total  cost 
is  in  the  monotone  case.  As  will  be  seen  in  the  section,  two  often  used 
stopping  times  (types  I and  II  censoring)  arise  as  Bayes'  procedures  with  the 
appropriate  choice  of  p and  the  costs. 

In  Section  3,  the  large  sample  properties  of  the  Bayes'  sequential  pro- 
cedures are  studied  when  only  one  cost  is  positive. 

In  Section  4,  the  class  of  rules  to  be  considered  is  restricted  to  those 
stopping  at  an  arrival,  and  the  additional  cost  incurred  by  using  a modifi- 
cation of  the  continuous  time  Bayes'  sequential  procedure  rather  than  the  optimal 
rule  in  this  restricted  class  is  seen  to  be  no  larger  than  p - 2 for  p = 0,1,2. 
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2.  The  Bayes'  sequential  procedures.  Henceforth,  assume  0 has  prior  distri- 

-1  i 

bution  Ap(e)  = 0 ° e , where  > 0»  “q  ^ P»  denote  this  j 

distribution  gamma  With  this  prior,  the  posterior  distribution  of  6 I 

given  ^9{t)  is  gamma  (a^,6^),  where  ^ X(t)  and  ♦ t.  Using  loss  i 

Lp  defined  in  (l.i),  the  posterior  expected  loss  is  | 

I 

(2.1)  E[Lp(e,0)|/y(t)]  * r(a^+l-p)r(a^)"’, 

and  the  Bayes'  estimator  of  0 given  /9(t)  is 

(2.2)  0p(t)  = (a^-p)S'’. 

The  total  cost  of  observing  the  process  for  t units  of  time  is 

(2.3)  ^p(t,x(t))  = BP‘^r(a^+l-p)r(a^)'^  + C^X(t)  + c^t. 

The  Bayes'  sequential  procedure  (BSP)  minimizes  E^'p(t,X{t))  over  all  pairs 

A 

(o,0p(o))  where  a is  a stopping  time  with  respect  to  t > 0}  and  0p(o)  is 

an  estimator  of  0,  and  ^ (o)  measurable.  Since  the  Poisson  process  is  strong 
Markov,  only  estimators  which  are  Bayes'  given  ^{a)  need  be  considered  (namely,  (2.2V 

with  t replaced  by  a). 

Candidates  for  the  BSP  will  be  infinitesimal  look-ahead  rules  derived  from 
the  characteristic  operator  of  X(t)  at  <^’p(t,x).  Such  rules  can  be  shown  to  be 
Bayes'  if  the  characteristic  operator  changes  sign  only  once  and  if  the  expected 

A 

cost  of  using  procedure  (a,ep(o))  can  be  expressed  in  terms  of  the  operator  for 
a large  class  of  stopping  times  o. 


H 

! 
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2 

Let  f(t,x)  be  measurable  and  finite  on  [0,*)  . Define 
(2.4)  D^(t.x.h)  - Ei:^(t^h.X(t>h))j(t.X(tJ)_»_(t,)01-__fit^ 

and  consider  only  those  functions  f such  that  Df(t.x,h)  is  uniformly  bounded 
in  h > 0 and  the  limit  of  exists  as  h decreases  to  zero.  For  such  f de- 
fine the  characteristic  operator  at  f as 


1 

I 


(2.5)  Af(t,x)  * lim  D^(t,x,h). 

hiO 

Of  particular  interest  is  the  characteristic  operator  at  f^(t,x)  which  is  given 
in  the  lemma  below. 

Lemma  2.1.  A 4jj(t,x)  = -6^"^r(aQ+x-p+l  )r(aQ+x)"^  + ^A^^o^’^^^t^  * 

Proof.  See  appendix. 

Say  that  cost  tfp  is  in  the  monotone  case  if  A ^p(t,X(t))  changes  sign  at  most 
once.  Before  stating  when  is  in  the  monotone  case,  a result  about  the  ratio 
of  gamma  functions  is  needed. 

Lemma  2.2.  Suppose  w > 0.  Then  r(w-b)r(w)’^  is  increasing  in  w for  b < 0 
and  decreasing  in  w for  0 < b < w. 

Proof.  Take  b > 0,  and  note  that  r(w-b)r(w)"^  is  decreasing  in  w if  and  only 
if  log  r(w)  is  convex.  Thus,  it  suffices  to  show  that  the  log  r(w)  is  convex. 
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Fix  0 < a < 1,  and  w^,  w^.  Then  [a  logr(w^)  + (l-a)  logr(w2)]  > logr(aw^  ♦ (l-alw^) 

if  and  only  if  r(w^ )®r{w2)^'^  > r(aw^  + (l-a)*^^).  But  r(aw^  ♦ (l-a)*^) 

aw,  + (l-a)w~-l 
= /X  ' dx 

aw,-a  (l-a)w--(l-a) 

* /x  X dut  du  * ■»"*dx, 

aw, -a  {l-a)wp+{l-a) 

< ||x  lli/all’^  lh/{l-a)’  Holder’s  inequality. 

This  last  expression  is  equal  to  r(w^)®  r(w2)^"*.  . 

I 

I 

Lemma  2.3.  In  each  of  the  cases  (i)  - (iii)  below,  is  in  the  monotone  case:  ' 

(i)  Cj  = 0,  c^  > 0,  0 < p < 1; 

(ii)  1 < p < 2;  ! 

(iii)  c^  = 0,  c.|.  > 0,  2 < p < 3. 


I 


1 


I 

1 


Proof.  For  (i),  A 'j^p  > 0 if  and  only  if  c^r(a^+l )r(a^-p+l )'^  > 6^”^.  Since 
the  rhs  is  decreasing  in  t and  the  Ihs  is  nondecreasing  in  t,  (i)  is  in  the 
monotone  case.  For  (ii),  A jj'p  > 0 iff  c^a^  + c^e^  > 6j‘^r{a^-(p-l))r(o^)‘\ 
where  the  rhs  is  decreasing  in  t and  the  Ihs  is  increasing  in  t.  For  (iii), 
A )?p  > 0 iff  CyB^  ^ > r(a^-p+l )r(a^)”\  with  the  rhs  nonincreasing  in  t and 
the  Ihs  increasing  in  t. 

From  now  on,  only  those  tf’p  in  the  monotone  case  will  be  considered. 

For  those  p,  c,^,  and  Cj  such  that  <fp  is  in  the  monotone  case,  define 

the  stopping  time 

(2.6)  Xp  = first  t > 0 such  that  A ^p(t,X(t))  > 0 


1^ 


I 
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Another  property  of  ratios  of  gamma  functions  is  needed  to  derive  bounds 
on  Tp  and  X(Tp). 

Lenina  2.4. 

If  w > p ^ 0,  then  r{w-p)r(w)”^  > w“*^. 

If  0 < p < 1 and  w > p + 1,  then  r(w-p)r(w)”^  £ (w-(p+l ))”**. 

If  p > 1 and  w > p,  then  r(w-p)r(w)"^  < (w-p)"^. 

Proof.  Let  a be  in  [0,«)  and  define  h(w,a)  * (w-a)^  r(w-p)r(w)”^t  for  w > a.  Then 
h(w,a)  < h(w+l,a)  if  and  only  if 

(2.7)  log  w + p log(w-a)  <.  p log  (w+l-a)  + log(w-p). 

If  p > 1,  then  using  the  concavity  of  logs,  (2.7)  holds  with  a « p;  thus, 
h(w,p)  < h(w+l,p).  If  p < 1,  then  (2.7)  holds  with  a » p + 1 , and  hence 

h(w,p+l)  < h(w+l,p+l).  Noting  that  h(w,a)  -*•  1 as  w * for  each  a yields  the 

upper  bounds  in  the  lemma.  Choosing  a » 0 reverses  the  inequality  in  (2.7)  and 

gives  the  lawer  bound,  with  only  w > p required. 

Lemma  2.5.  For  the  cases  of  Lemma  2.3,  the  following  bounds  on  Tp  and  X(Tp)  hold: 


1 

0 < p < 2, 

(ii) 

1 < p < 3, 

' (iii) 

0 < p < 2, 

X(y  <c-'/p 

1 

1 < p £ 3, 

X(Tp)  < s^3)/(P-') 

where  the  bounds  are  infinite  if  the  cost  involved  is  zero. 


I 


Proof.  For  (i),  A f p < 0 implies  that 

< c’^  r(a^+l-p)r(a^+l)*’ 


This  implies  the  bound  claimed,  (ii)  is  proven  in  a similar  fashion. 

For  (Hi),  A l^p  < 0 implies  that 

c^r(a^+l)  < 6P‘^r(a^+l-p) 

< BP'^r(a^+l-p). 

Next,  if  p > 1,  then 

c^B^'P  < r(a^+l-p)r(a^+l)’^ 

1 (ot^+l-p)'P  by  Lemma  2.4. 

If  0 < p < 1,  then  c^Bq^'P^  < that  = X(t)  + 

yields  (iii).  Part  (iv)  is  proven  by  a similar  argument. 

I To  deduce  the  optimality  of  rules  Tp,  E^p(o,X(a))  will  be  expressed  in 

terms  of  A Referring  back  to  f(t,x)  and  Af(t,x)  defined  earlier  in  the 
section,  let  o denote  a stopping  time  with  Eo  < ®.  Also  note  that  X(t)  is 
marginally  a strong  Markov  process.  If  f is  bounded  and  f(t,X(t))  is  a.s. 

right  continuous  in  t,  then  the  following  identity  due  to  Dynkin  is  well  known: 

il: 

•), 

t o 

(2.8)  Ef(o,X(o))  = E f Af(t,X(t))  dt  + f(0,0). 
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For  details,  see  Athreya  and  Kurtz  (1973).  In  the  present  situation  ftp(t,x) 
is  not  a bounded  function,  and  hence  some  extension  of  (2.8)  which  will  include 
i^p(t,x)  is  needed. 

Lemma  2.6.  Suppose  f is  nonnegative  and  continuous  in  (t,x).  Let  o be  a 

o 

stopping  time  such  that  E / |Af(t,X(t))|dt  < *. 

0 

(i)  If  EX(o)  < <»  and  if  f is  nonincreasing  in  t and  nondecreasing  in  x such 
that  Ef(0,X(a))  < »,  then  (2.8)  holds. 

(ii)  If  Eo  < * and  if  f is  nondecreasing  in  t and  nonincreasing  in  x 
such  that  Ef(o,0)  < ®,  then  (2.8)  holds. 

Proof.  (i)  Define  the  following  sets  of  stopping  times: 

a 

S*  = [a:  EX(a)  < »,  Ef(0,X(o))  < »,  E / |Af (t,X(t)) |dt  < «), 

0 

S|  = (o:  o is  in  and  Eo  < *}, 

= {a:  a is  in  and  X(o)  is  bounded}. 

First,  (2.8),  will  be  shown  to  hold  for  a in  S|,  then  for  a in  S|,  and 
finally  for  o in  S|. 

Let  o be  in  S|.  Then  there  exists  m < » such  that  X(a)  < m.  Define 
f^(t,x)  = f(t,x)  on  [0,  <»)  X [0,m],  and  = f(t,m)  otherwise.  Then  (2.8)  holds 

for  fj^  and  a.  But  X(o)  < m implies  that  f^^  can  be  replaced  by  f in  the  ex- 

pression of  (2.8).  Thus,  (2.8)  holds  for  all  a in  S|. 

Now  take  o in  S|,  and  sequence  mj^  + «>.  Define  stopping  times 
0|^  = o if  X(a)  < mj^  and  = tj^  if  X(a)  _>  mj^,  where  tj^  is  the  first  time  that 

X(t)  = mj^.  Then  Oj^  is  in  S|  and  (2.8)  holds  for  f and  Oj^.  However, 

Ef(a|^,X(a|^))  = Ef(a,X(o))  [X(a)  < m|^]  + Ef (t|^,X(t|^) ) [X(a)  The  first 
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tem  tends  to  Ef(o,X(o))  and  the  second  term  tends  to  0 since  f(o,X(a))  and 
^(t|^,X(t|^) ) are  both  bounded  by  f (0,X(o) ), which  is  integrable, and  EX(o)  < “>. 

^ 0 

In  like  manner  E / Af(t,X(t))  dt  = E / Af{t,X(t))  dt  [X(o)  < m. ] 

.0  0 

a 

+ E / Af(t,X(t))  dt  [X{o)  > m.],  where  the  first  term  tends  to  E / Af(t,X(t))dt 
0 0 

a 

and  the  second  term  is  bounded  by  E / |Af|[X(o)  > m. ] which  tends  to  0. 

0 *• 

Now  take  o in  S^,  and  sequence  tj^  f The  truncation  is  now  done  on  o. 

Define  = min(o,t|^).  Then  0|^  is  in  for  all  k and  (2.8)  holds  for  Oj^  and  f 

As  before,  Ef(0|^,X(a|^))  = Ef(o,X(o))[o  1 tj^]  + Ef (t(^,X(t|^) )[o  > tj^].  Note  that  o 

in  implies  EX(a)  < and  hence  P(o  < ®)  = 1.  Using  this  and  arguments  similar 

to  those  above  imply  that  Ef(0|^,X(0|^))  tends  to  Ef(o,X(a)).  Finally,  using 

°k  a 

similar  arguments,  E / Af(t,X(t))dt  can  be  shown  to  tend  to  E / Af(t,X(t))dt. 

0 0 

(ii)  The  proof  is  parallel  to  that  of  (i)  with  the  roles  of  X(a)  and  o reversed 

The  lemma  below  makes  precise  the  roles  of  the  characteristic  operator  and 

Dynkin's  identity  in  proving  the  optimality  of  Xp. 


Lemma  2.7.  Consider  only  those  Jp  in  the  monotone  case,  and  define  Sp  = {o: 
either  (2.8)  holds  with  f(t,x)  = T^(t,x)  or  E ^p(a,X(a))  = “}.  If  Xp  is  in  Sp 
and  E irp(Xp,X(Xp))  < »,  then  E £^p(Xp,X(Xp) ) < E lfp(a,X(o)) , for  all  a in  Sp. 

Proof.  Let  a be  a stopping  time  such  that  (2.8)  holds  and  E^p(o,X(a))  is 
finite,  and  recall  the  definition  of  Xp.  Then 
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o P 

E r;(a,X(o))  - E i^.(T^.X(T J)  = E / A - E / 

K hkp  0^0^ 

0 

= E[o  > T ] / AC'  - E[o  < T ] / ACT 

H Tp  0 

which  is  nonnegative  by  definition  of  and  the  monotone  property  of  C’p. 

Theorem  2.1 . Suppose  is  in  the  monotone  case.  Then  Tp  is  optimal  in  that 
E 5’p('rp.X(Tp) ) < E C’p(a,X(o))  for  all  stopping  times  o. 

Proof.  In  each  of  the  cases  (i)-(iii)  of  Lemma  2.3,  the  class  Sp  defined  in 

Lemma  2.7  will  be  shown  to  include  all  stopping  times  o.  (i)  In  this  case, 

^p(t,x)  is  decreasing  in  t and  increasing  in  x.  Suppose  o is  a stopping 

time  with  EX(a)  < «=  and  note  that  if  EX(a)  = <»  then  the  expected  cost  is  infinite. 

Next,  ifp(0,x)  is  bounded  above  by  («Q+x+l-p)^”^  + c^x  using  the  lower 

bound  in  Lemma  2.4.  Thus,  EX(a)  < «>  implies  that  E ^(0,X(a))  < «>.  In  like 
o ^ 

manner,  E f |A  jf  J < ®,  and  Lemma  2.6  (i)  implies  that  S contains  all  o.  Part 
0 ^ ^ 

(i)  is  completed  by  noting  that  E Sp(Tp,X(Tp))  < ». 

For  case  (ii),  first  note  that  E ^(Tp,X(Tp))  < ».  The  cost  is 

?^Jj(t,x)  = (BQ+t)P"^r(aQ+x+l-p)r(aQ+x)"^  + c^x  + Cjt,  where  the  first  term  is 

decreasing  in  both  t and  x.  Using  techniques  similar  to  those  in  Lemma 

2.6,  it  can  be  shown  that  (2.8)  holds  with  f{t,x)  = (BQ+t)^'^r(aQ+x+l-p)r(aQ+x)  ^ 

and  o a stopping  time  such  that  P(a  < >»)  = 1.  Then  Lemma  2.6  can  be  applied 

directly  to  the  last  two  terms  of  the  cost  depending  on  which  costs  are  nonzero 

to  yield  Sp  containing  all  a. 


For  (iii),  again  note  that  E C^p{Tp,X(Tp) ) < »,  and  that  the  cost  is  now 


increasing  in  t and  decreasing  in  x.  Consider  o with  Eo  < “>  since  the  ex- 


pected cost  is  infinite  if  Eo  = «>.  Then  E Aa,0)  < ® since  Eo  < <*>  implies  that 

r n 


.P-2< 


“>  for  2 < p < 3.  Also  lAr_|  is  bounded  which  implies  that  E / 


and  Lemma  2.6  (ii)  can  be  used  to  imply  that  contains  all  o. 

P 


If  either  Cj  = p = 0 or  c^  = 0 and  p = 1,  then  stopping  rule  t is  type  I 


censoring.  Note  that  the  X(Tp)  is  unbounded  for  this  censoring,  while  Xp  is  bounded. 


If  either  Cj  = 0 and  p = 2 or  c^  = 0 and  p = 3,  then  x is  type  II  censoring 


and  Xp  is  unbounded  while  X(Xp)  is  bounded.  For  all  other  Xp  both  X(xp)  and  Xp 


are  bounded  which  may  be  an  important  consideration  in  applications.  Finally, 


the  stopping  times  Xp  are  (computationally)  easy  to  use, especially  for  integer  p. 


3.  Large  sample  properties  of  x . In  this  section  the  Bayes'  sequential  pro- 


cedures (Xp,e(Tp)),  0 < p < 3,  are  examined  from  the  sampling  theory  perspective. 


The  parameter  0 is  considered  fixed  but  unknown  and  all  probabilities  and  ex- 


pectations are  conditional  on  6 and  denoted  by  Pp  and  Ep,  respectively. 


Attention  will  be  restricted  to  the  case  when  exactly  one  of  the  pair 
is  positive  and  the  other  is  zero.  Of  interest  is  the  large  sample  behavior  of 


the  procedures. 


Write  'r(p,c^,c.|.,aQ,BQ)  = Xp  to  make  explicit  the  dependence  of  Xp  on  the 
various  design  parameters.  It  is  easy  to  verify  that,  for  0 £ p 1 2, 
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i 

I 

I For  c > 0,  and  1 1 p £ 3,  define 

I (3.2)  t*  = t*(e,c)  = C-V2 

( 

Lemma  3.1. 

, (i)  Let  = 0,  Cj  = c > 0.  For  every  e > 0 and  1 £ p < 3, 

f. 

E 

I Pe^Tp/tJ  - II  > e]  £k  exp[c‘’''^D(c,e,0,p)], 

f 

! 

for  k a constant  and  D(c,e,0,p)  -►  D(e,0,p)  < 0 and  finite  as  c 0. 

(ii)  Let  Cy  = 0,  = c > 0.  For  every  e > 0 and  0 £ p £ 2, 

^0^IV^P+1  ■ ^ exp[c‘^^^  D(c,e,0,p+1)], 

for  k and  D given  in  (i). 

Proof,  (i)  The  result  is  easy  to  obtain  for  p = 1.  For  p > 1,  write  t = Tp, 
and  t*  = t*.  Then  RgClx/t*  - 1 1 > e]  equals 

(3.3)  Pg[T/t*  > 1 + e]  + Pg[T/t*  < 1 - e]. 


1 The  first  term  in  (3.3)  equals  Pq[t  > s]  with  s = t*b  and  b = 1 + e.  By  Lemma 

! 2.4.Pq[t  > s]  does  not  exceed 

I ® 
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(3.4)  Pg[X(s)  < (2-ajj)  + 

By  Bernstein's  inequality  for  0 < u < 1,  (3.4)  is  bounded  above  by 
2 - 1 /P  + 

e exp[c  '‘'D  (c,e,0,p,u)] 


with 


D'^(c,e,e,p,u)  = ub0^^"P^^^[(6^/s)^P"^^^^P"^^b^'^^^’P^  + (e"“  -l)u”^]. 

Note  that  (35/s)^P'^^/(P~^ ^ f 1 as  c >1^  0;  b^^^^'P^  < 1;  and  (e’'^  - l)u'^  - - 1 
as  u ->■  0.  Thus,  there  exists  uj  > 0 such  that  for  all  u,  0 < u < uj  D'^(c,e,0,p,u)  < 
for  all  c. 

The  second  term  in  (3.3)  equals  Pq(t  < s)  with  s = t*b  and  b = 1 - e.  A 
similar  argument  to  the  one  above  shows  that  P„(t  < s)  is  bounded  above  by 

ot  ^ 

e ° exp[c  ^^^D“(c,e,0,p,u)]  with 

D (c,£,0,p,u)  = ub0^^”P^'^^[(e^  - l)u“^  - (6j/s)^P"^^^^P“^^  b^^^^“P^]. 

Thus,  there  exists  u^  such  that  for  allu,  0<u^Uq,  D is  negative  for  all 
c < Cq  for  some  c^  > 0.  Let  u^  = min(uj,u’)  and  let  D(c.e,0,p)  = max[D''(c,e,0,p,Ujj), 

D“(c,e,0,p,UQ)]. 

(ii)  may  be  proven  by  the  above  argument  and  the  identity  (3.1). 
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Lemtna  3.2.  (i)  Let  = 0.  = c > 0.  For  1 < p < 3,  Xp/tJ  is  uniformly 

integrable  as  c -♦  0. 

(ii)  Let  Cj  = 0,  = c > 0.  For  0 < p < 2,  Tp/t*^^  is  uniformly  integrable 

as  c 0. 

Proof,  (i)  Let  = hp/t*-  Ij.  It  suffices  to  show  that  for  fixed  a > 0, 

/ Y dP„  tends  to  0 as  c -►  0.  By  Leirmas  2.5  and  3.1, 

[»c>a]  ' ” 

/ Y.  dP.  < 9(P-')/2['|.-1/2j(p-1)/(3-P)  exp[c'’^^D(c.a.e,p)] 

[Y,>a] 

which  tends  to  0 as  c 0. 

(ii)  can  be  proven  with  the  same  argument. 

Theorem  3.1. 

(i)  Let  c^  = 0,  Cy  = c > 0.  For  1 < p < 3, 

lim  T„/t!;  =1  (in  P„  probability)  and  lim  E„(t  /t*)  = 1. 
cM)  P P ® c->0  “ P P 

(ii)  Let  Cj  = 0,  c^  = c > 0.  For  0 £ p <.  2, 

lim  Tp/tJ^^  = 1 (in  Pq  probability)  and  lim  EQ(Tp/tJ^.^)  = 1. 

Proof,  (i)  The  first  limit  follows  immediately  from  Lemma  3.1.  If  1 < p < 3, 
then  the  second  limit  follows  from  Lemmas  3.1  and  3.2.  If  p = 3,  then  the  de- 
finition of  implies  that 


c“^^^  - < X(t3)  < + (3-cIq). 


1 

! 

and,  I 

•V 

V. 

I - (t5)-\e-'  i (t|)-’Eet3  < 1 * (t5)-'e-'(3-»„).  ( 

i' 

i; 

Noting  that  t|  -►  » as  c -►  0 completes  the  proof.  The  proof  of  (ii)  is  similar. 

Theorem  3.2. 

(i)  Let  = 0,  Cj  = c > 0.  For  1 < p < 3, 

c-0  0 P P P 

(ii)  Let  Cy  = 0,  = c > 0.  For  0 < p < 2, 

Proof:  (i)  Write  t = x^,  t*  = t*.  From  the  definition  of  r, 

(3.5)  c > 6P‘^r(a^+l-p)r(a^)"^ 

which  implies  that  c 1 ^ upper  bound  tends  to 

29O  p)/2  jjy  Theorem  3.1.  For  a lower  bound,  let  x - e be  the  time  of  the 
X(x)  - 1 St  arrival.  Then  the  reverse  inequality  of  (3.5)  is  satisfied  at  x - e 
and  - 1 to  yield 

c < 3P:Jr(cx^-p)r(a^-l)'\ 
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This  implies  that 

(3.6)  EgTj  i Ec'/\[l*£(6,-e)'’]‘”’[1*(l-p)(v'>'’l  * '’^V' 

The  function  inside  the  expectation  of  expression  (3.6)  is  uniformly  integrable 
and  tends  to  in  probability  by  Theorem  3.1  and  the  result  that  both  t 

and  a tend  to  ® as  c tends  to  0. 

T 

(ii)  First  note  that  E^XCt)  = c^'^^GEgT,  (by  Doob's  optional  stopping  theorem), 
which  tends  to  e(2-P)/2  Theorem  3.1.  Now,  from  the  definition  of  t, 

c\  > 6j"^r(a^+l-p)r(a^)’^ 


which  implies  that 


* c'/%X(T). 


This  upper  bound  tends  to  Using  the  same  techniques  as  in  (i)  yields 


EeC^''\-p)[6.r(B^-e)'^]P'^  + c^/%X(t) 

which  tends  to 

Once  the  limiting  form  of  t is  derived,  the  asymptotic  distribution  of 
e(T)  can  be  found  by  straightforward  methods. 
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Theorem  3.3.  If  either  = 0,  = c > 0 and  1 < p < 3,  or  Cj  » 0,  » c > 0, 

and  0 < p < 2,  then 

Tp^^(9(Tp)  - 6)6  •*  Z (in  distribution), 

as  c -►  0,  where  Z is  standard  normal. 


Proof.  The  result  is  an  extension  of  the  theorem  on  random  sums  of  Independent 
and  identically  distributed  random  variables  by  Renyi  (1957).  The  proof  given 
in  Shapiro  and  Wardrop  (1977)  may  be  used  here. 

Theorem  3.3  says  that  for  small  c,  6(Tp)  has  approximately  a normal  dis- 
stribution  with  mean  6 and  variance  Q/tp.  If  Cj  >0,  c^  = 0,  and  1 £ p < 3, 
Theorem  3.1  allows  replacement  of  Tp  by  its  asymptotic  equivalent  t*  to  give 


the  variance  of  0(Tp)  asymptotically  equivalent  to  c^^^  6^^*^^^^.  Thus, 
p may  be  chosen  by  the  experimenter  to  obtain  the  asymptotic  variance  of 
e(Tp)  proportional  to  the  desired  power  of  6.  For  c^^  > 0,  Cj  = 0,  0 <.  p < 2, 

A 

the  same  argument  shows  that  the  asvmntotic  variance  of  9(Tp)  is  proportion  to 

el+(p/2)^ 


4.  Applications  to  discrete  time  rules.  Let  H denote  the  class  of  stopping 
times  with  respect  to  (^^.t  > 0}.  In  previous  sections  the  optimal  stopping 
time  in  H under  various  cost  functions  was  derived  and  studied.  In  some 
situations,  one  may  prefer  to  find  the  optimal  stopping  time  in  the  class  DCH, 
where  D is  the  collection  of  all  rules  which  stop  only  at  arrival  times.  More 

precisely,  let  t^  = 0,  and  t.  = inf{t:  X(t)  = i>,  i >1.  Then  a stopping  time  a 

00 

is  in  D if  and  only  if  E P(a=t.)  =1.  In  this  section  the  problem  of  finding 

i=0  ’ 

an  optimal  stopping  time  in  D will  be  discussed  and  nearly  optimal  procedures 
will  be  presented.  Throughout  this  section,  take  Cj  = 0,  c^  > 0 and  p = 0, 

1,  or  2.  This  problem  with  the  above  sampling  costs  but  different  estimation 
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cost  has  been  studied  by  Starr  and  Woodroofe  (1972)  from  a non-Bayesian  point 
of  view.  For  p = 2,  note  that  T2  is  in  D and  hence  is  optimal  In  0 be- 
cause It  is  optimal  in  the  larger  class  H.  There  appears  to  be  no  simple-to- 
use  expression  for  the  optimal  stopping  time  in  D for  p = 0 or  1.  However, 
nearly  optimal  rules  in  D for  this  case  can  be  derived  from  the  optimal  rules 

in  H in  a natural  way.  For  p = 0,  1,  define  a * t / »,  where 

P n(p)’ 

n(p)=inf  {m:  t^^  > Tp}.  It  is  easy  to  see  that 


and 


0 0 0 0 


+ c.. 


Oj  - T, 


Thus,  E(t  ) - infE(r  ) < E(^  ) - E(^) 

p oeD  p p 

< c^(2-p)  for  p = 0,  1, 


Thus,  the  expected  cost  incurred  by  using  Op  instead  of  the  unknown  optimal 
rule  in  D is  at  most  equal  to  the  cost  of  2 - p additional  observations. 


J 
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Appendlx 


Proof  of  Letuna  2.1.  Write 

^p(t,x)  = Hp(t,x)  + c^x  + Cj.t,  where  Hp(t,x)  = B^’^r(ajj+x+l-p)r(aQ+x)"\ 
the  cost  due  to  estimation.  Then,  A^p(t,x)  = AHp(t,x)  + + c.j.,  with 

the  last  two  terms  being  easily  computed  since  Ax  = (aQ+x)B^^.  To  derive  AHp(t,x), 
write 


E(Hp(t+h,X(t+h))-Hp(t,x)|(t,X(t))  = (t,x),e) 

= E(Bj;hr(ajj+x+Y+l-p)r(ajj+x+Yr’-Hp(t,x)|(t,X(t))  = (t,x),e) 
where  Y is  Poisson  (6h)  given  0.  Express  this  expectation  as 
rp,o(0’h»t)  + '^p,i 

rp^jj(0,h,t)  = (bJ;^  - 3P■^)^(a^+l-p)^(a^)■^e■®^ 

rp^l(0,h.t)  = [bJ;^  (a^+2-p)r(a^+l)'''  - B5'^r(a^+l-p)r(a^)"'’ ]0he'®^, 

" I [65;hr(VK+l-p)r(a^+K)'''  - 0j"^r(a^.+l-p)r(a^)‘^]^  e'®^. 
K—Z 

I (t,X(t))  = (t,x)]  will  be  shown  to  be  o(h).  Define 
P*^ 
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d|^{t)  = TT^  (a^-P+i)(a^-l+i)~\ 

K -K 

and  note  that  d|^(t)  < 1 for  p > 1,  and  that  dj^(t)  £ (o^-p+1)  for  0 < p < 1 
In  terms  of  d|^(t), 

!ip_2(9.h.t)  = (8P;2-BP-^)r(ap-p*I)r(ap)-’dn(t)%lV«\ 

Thus,  for  p ^ 1 , 

1 r(a^-p+I)r(a^)’^lB5;J-Bf‘^|e‘®^e®^l-0h| 

which  is  o(h)  as  h + 0,  and  is  also  dominated  by  an  integrable  function  of  0 
for  h in  a neighborhood  of  zero.  Thus,  E(R  9(0,h,t) | (t,X(t))  = (t,x))  = o(h) 
Likewise,  for  0 £ p < 1, 

0h(a.-p+l)a"^ 

+ |e  -l-0h(a^-p+l )a^  [] 

which  yields  E(R^  «(9,h,t) ( (t,X(t) ) = (t,x))  = o(h). 

Finally,  as  h I'  0, 
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n-7  -1 

^ (p-2)b5  ^r(a^+l-p)r(ot^)  \ 


P - -h — bJ  ^(r(a^+2-p)r(a^+i)  '-r(a^+i-p)r(o^)’Me. 


Using  E[0|(t,X(t))  = (t,x)]  = (aQ+x)6^\  and  noting  that  the  limit  and  ex- 
pectation can  be  interchanged  yields 


h"^E(rp^Q+rp^^|(t,X(t))  = (t,x)) 


-BP‘^r(a^+l-p)r(a^)"’  as  h 4-  0 


completing  the  proof. 
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ST R ACT  ^Concfnua  on  rovorao  aid*  II  nacaoonfY'^and  Idmnttry  by  block  numbmr) 

Conditional  on  the  value  of  9,  9 > 0,  let  X(t) , t > 0,  be  a homogteneous 
Poisson  process.  Sequential  estimation  procedures  of  the  form  (o»S(a))  ate  con- 
sidered. To  measure  loss  due  to  estimation,  a family  of  functions,  indexed  by 
p,  is  used:  L(9,§)  = /9“P (9-§) ^ , and  the  cost  of  sampling  involves  cost  per 
arrival  and  cost  per  unit  time.  The  notion  of  ^monotone  case**^for  total  cost 
functions  of  a continuous  time  process  is  defined  in  terms  of  the  characteris- 
tic operator  of  the  process  at  the  total  cost  function.  The  Bayes'  sequential 
procedure  is  then  derived  for  those  cost  functions  in  the  monotone  case  using  , 
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extensions  of  Dynkin's  identity  for  the  characteristic  operator, 
ties  of  these  procedures  are  studied  as  sampling  costs  tend  to  0 
cedures  are  then  modified  and  compared  with  procedures  which  are 
all  stopping  rules  terminating  at  arrivals. 
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